Diversity is All You Need: Learning Skills without a Reward Function
نویسندگان
چکیده
Intelligent creatures can explore their environments and learn useful skills without supervision. In this paper, we propose DIAYN (“Diversity is All You Need”), a method for learning useful skills without a reward function. Our proposed method learns skills by maximizing an information theoretic objective using a maximum entropy policy. On a variety of simulated robotic tasks, we show that this simple objective results in the unsupervised emergence of diverse skills, such as walking and jumping. In a number of reinforcement learning benchmark environments, our method is able to learn a skill that solves the benchmark task despite never receiving the true task reward. In these environments, some of the learned skills correspond to solving the task, and each skill that solves the task does so in a distinct manner. Our results suggest that unsupervised discovery of skills can serve as an effective pretraining mechanism for overcoming challenges of exploration and data efficiency in reinforcement learning.
منابع مشابه
Effective Factors on Children's Selective Trust in Other's Testimony
Children live socially from birth to adulthood, and learning is an integral part of their living. They won’t achieve the knowledge and skills for life without learning. However, childhood period is not lasting enough for learning all of the massive amounts of information and skills required for living in this world as adults and children aren’t able to acquire the whole of knowledge and skills ...
متن کاملP14: How to Find a Talent?
Talents may be artistic or technical, mental or physical, personal or social. You can be a talented introvert or a talented extrovert. Learning to look for your talents in the right places and building those talents into skills and abilities might take some work, but going about it creatively will let you explore your natural abilities and find your innate talents. You’re not going to fin...
متن کاملReinforcement Learning with Human Feedback in Mountain Car
As computational agents are increasingly used beyond research labs, their success will depend on their ability to learn new skills and adapt to their dynamic, complex environments. If human users —without programming skills — can transfer their task knowledge to the agents, learning rates can increase dramatically, reducing costly trials. The TAMER framework guides the design of agents whose be...
متن کاملP25: Talent and Perseverance
Many people think that all you need to succeed at anything is talent but talent alone without perseverance and determination, cannot help you achieve success. Talent is helpful but perseverance ensured one achieves success. A child can show an exceptional talent for storytelling, but if he ignores his teacher’s comments and doesn’t work on his stories, he will never be a great novel...
متن کاملLearning Roles: Behavioral Diversity in Robot Teams
This paper describes research investigating behavioral specialization in learning robot teams. Each agent is provided a common set of skills (motor schema-based behavioral assemblages) from which it builds a taskachieving strategy using reinforcement learning. The agents learn individually to activate particular behavioral assemblages given their current situation and a reward signal. The exper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1802.06070 شماره
صفحات -
تاریخ انتشار 2018